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The invention relates to a digital transmission 



f 



system comprising a transmitter and a receiver^ for 
transmitting a wide-band digital signal^o^-a-spe^rf^c- 
+1 sample frequency F s , for example a digital audio signal, ^ 
transmission medium, and for receiving said signal, > 



via a 



0 y^traagmittei h a . i n y a n in put tormina fe^ee^-iv^ng-^e- 
*g ^^ - band -d^fe^r-^gw il, which input -^erm^na^-xs-^eupjred-^ 
te- an input of n signal smi r co which f o r m s — pa r L o£ ~^ fae ~" 
fegara mitter and which i s-^ofts feruct e d to - g^graiJEr-a-sgc oTTd- 

10 dig-iLal sign al and avvpty-ee^^ outpuL ' ]£3f*~ 

t second digital sign^fccomprises consecutive frames, A each 

frame comprising a plurality of information packets, each 
information packet comprising N bits, N being larger than , 
l f ^ N receiver N ^Sp^s^g a decoder having an input for 
£ 15 receiving the second digital signal, wfei^decoder has an 

output coupled to a^ output terminal to supply the wide- V. 
£ band digital signal The invention also relates to a 

transmitter and a receiver for use in the transmission 
system, to a transmitter in the form of a device for 
2 0 recording the second digital signal in a track on a record 
carrier, to a record carrier obtained by means of the 
transmitter, and to a receiver in the form of a device for 
reading the second digital signal from the track on the 
record carrier^! transmission system of the type defined 
25 in the opening sentence is known from the article 

"The Critical Band Coder - Digital Encoding of Speech 
signals based on the Percentual requirements of the 
0 Auditory System- by M.E. Krasner in Proc. IEEE ICASSP 80, 
f> F \4 Vol.1, pp. 327-331, April 9-11, 1980. This article relates 

30 to a transmission system in which the transmitter employs a 
subband coding system and the receiver employs a 
corresponding subband decoding system, but the invention 1S 
not limited to such a coding system, as will become 
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apparent hereinafter. 

In the system known from said publication the 
speech signal band is divided into a plurality of subbands 
whose bandwidth approximately corresponds to the bandwidths 
5 of the critical bands of the human ear in the respective 
frequency ranges (cf. Fig. 2 in the article of Krasner) . 
This division has been selected because on the ground of 
psycho-acoustic experiments it is foreseeable that the 
quantisation noise in such a subband will be masked to an 
10 optimum extent by the signals in this subband if in the 

quantisation allowance is made for the noise-masking curve 
of the human ear (this curve gives the threshold value for 
noise masking in a critical band by a single tone in the 
centre of the critical band, cf. Fig. 3 in the article by 
15 Krasner) . 

In the case of a high-quality digital music 
signal, which in conformity with the Compact Disc Standard 
is represented by 16 bits per signal sample in the case of 
^ a sample frequency of 1/T =44.1 kHz, it is found that with 

2 0 a suitably selected bandwidth and a suitably selected 

quantisation for the respective subbands the use of this 
known subband-coding system yields quantised output signals 
of the coder which can be represented by an average number 
of approximately 2.5 bits per signal sample, the quality of 
25 the replica of the music signal not differing perceptibly 

from that of the original music signal in substantially all 
passages of substantially all kinds of music signals. 

The subbands need not necessarily correspond to the 
bandwidths of the critical bands of the human ear. 

3 0 Alternatively, the subbands may have other bandwidths, for 

example they may all have the same bandwidth, provided that 
allowance is made for this in determining the masking 

rj threshold. A^AAinATW^t^ @f 

^0^^ It is an object hi the invention to provide a 

T 35 number of steps for the transmission system, in particular 
a very specific choice for the format with which the 
digital wide-band signal, after conversion into the second 
digital signal, can be transmitted via the transmission 
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medium, in such away that a flexible and highly versatile 
transmission system is obtained. This is to be understood 
to mean that the transmitter should be capable of 
converting wide-band digital signals of different formats 
(which formats differ inter alia in respect of the sample 
frequency F s of the wide-band digital signal, which may 
have different values such as 3 2 kHz, 44.1 kHz and 48 kHz, 
as laid down in the digital audio interface standard of the 
AES and the EBU) into the second digital signal*? 7 Similarly , 
the receiver should be capable of deriving a wide-band 
signal of the correct format from said second digital 
£ignnlT iJ2 T o t hi ~. pitI t b° A ramstn^ L . L ui i uyu Lu m ' in accordance 
with ttie inventionis^£L.c haraot o riao d i n-tfrat if P in the 
formula 

BR n s 
p = — x — 

N F s 

is an integer, where » 
BR is the bit rate of the second digital signalt-and n s is 
the.numb^f of samples of the wideband digital signal whose 
corresponding information, which belongs to the second 
digital signal, is included in one frame of the second 
digital signal,' the number conformation packets^ B > in one 
fr Le is P^ffi^f?^ '^fiF^^WP^«^ 
of information packets in a numbei^of the frames is P«, 
P' being the next lower tffie^^ffjF 1 ^ P ' and the number 
of information packets in the A other Trames is equal to 
r'llugfuas to °>f^-ny -r-nmpl-y with the requirement that the 
averageframe rate of the second digital signal should be 
substantially equal to F^/n£ and that a frame should 
comprise at least a first frame portion including the 
synchronising informational^ purpose of dividing the 
frames into B information packets is that for a wide-band 
digital signal of an arbitrary sample frequency Fg the 
average frame rate of the second digital signal 
transmitted by the transmitter is now such that the/ 
duration of a frame in the second digital signal 
\ corresponds to the duration occupied by n$ samples of the 
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wide-band signal. Moreover, this enables the 
synchronisation to be maintained on an information-packet 
basis, which is simpler and more reliable than maintaining 
the synchronisation on a bit basis. Thus, in those cases 
where P is not an integer, the transmitter is capable, at 
instants at which this possible and also necessary, to 
provide a frame with P'+l instead of P' information 
blocks, so that the average frame rate of the second 
£ i{ digital signal can be maintained equal to Fg/ n §- Since in 
10 this case the spacing between the synchronising information 
(synchronising signals or synchronising words) included in 
the first frame portion of succeeding frames is also an 
integral multiple of the length of an information packet it 
remains possible to maintain the synchronisation on an 
t 15 information packet basis^Pref erably , the first frame 

portion further contains information related to the number 
of information packets in a frame. In a frame comprising B 
4^ information packets this information may be equal to the 
4? value B. This means that this information corresponds to P' 
20^ for frames comprising P' information packets and to P'+l 
£ for frames comprising P'+l information packets. Another 

4p possibility is that this information corresponds to P' for 
U all frames, regardless of whether a frame comprises P' or 
4*^ P'+l information packets. The additionally inserted 
25 L (P«+i)th information packet may comprise for example merely 
"zeros". In that case this information packet does not 
contain any useful information. Of course, the additional 
information packet may also be filled with useful 
i informationrThe first frame portion may further comprise 

3 0 system information. This may include the sample frequency 
j[\ F s of the wide-band digital signal applied to the 

transmitter, copy-protection codes, the type of wide-band 
digital signal applied to the transmitter, such as a 
stereo-audio signal or a mono-audio signal, or a digital 
35 signal comprising two substantially independent audio 
signals. However, other system information is also 
possible, as will become apparent hereinafter. Including 
the system information makes it possible for the receiver 
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to be also flexible and enables the received second digital 
signal to be correctly reconverted into the wide-band 
digital signal. The second and the jthird frame portions of 
a frame contain signal information. The transmitter may 
comprise a coder comprising signal-splitting means 
responsive to the wide-band digital signal to generate a 
second digital signal in the form of a number of M 
subsignals, M being larger than 1, and comprising means for 
quantising the respective sub signals. For this purpose an 
arbitrary transform coding, such as the fast Fourier 
transform (FFT) may be used. In that case the transmission 
system is characterized in that the second frame portion of 
a frame contains allocation information which, for at least 
a number of sub signals, indicates the number of bits 
representing the samples of the quantised subsignals 
derived from said subsignals, and in that the third frame 
portion contains the samples of at least said quantised 
subsignals (if present). At the receiving end it is then 
necessary to apply an inverse transform coding, for example 
an inverse Fourier transform (IFFT) , to recover the wide- 
band digital signal? ^hp trana m^ss^o ii b y fa Len t, in which — the- 
signal-splitting means take the form of analysis-filter 
means responsive to the wide-band digita^signal to 
generate a number of M subband signals ,--w*w ! eh A analysis- 
filter means divide the signal band of the wide-band 
digital signal, using a sample-frequency reduction, into 
successive subbands having band numbers m increasing with 
the frequency, and w*-^whieh- the quantisation means are 
adapted to quantise the respective subband signals block by 
b loc k, ic a g y - itrm f^r 1 " Yi " ? e o d -rng- as aescriBed- 

aieve^ Such a transmission system is characterized further 
in that for at least a number of the subband signals the 
allocation information in the second frame portion of a 
frame specifies the number of bits representing the samples 
of the quantised subband signals derived from said subband 
signals and in that the third frame portion contains the 
samples of at least said quantised subband signals (if 
present^ This means in fact that the allocation information 
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♦ is inserted in a frame before the samples. This allocation 
information is needed to enable the continuous serial bit 
stream of the samples in the third frame portion to be 
subdivided into the various individual samp^s of the 
5 correct number of bits at the receiving end. The allocation 
information may require that all samples are represented by 
a fixed number of bits per subband per frame. This is 
referred to as a transmitter based on fixed or static bit 
allocation. The allocation information may also imply that 
10 a number of bits variable in time is used for the samples 

in a subband. This is referred to as a transmitter based on 
the system of adaptive or dynamic bit allocation. Fixed and 
adaptive bit allocation are described inter alia in the 
publication "Low bit-rate coding of high quality audio 
15 signals. An introduction to the MAS CAM system" by 

fa G. Theile et al, EBU Technical Review, No. 23 0 
fiF^ (August 1988) .Inserting the allocation information in a 

frame before the samples in a frame has the advantage that 
at the receiving end a simpler decoding becomes possible, 
20 which can be carried out in real time and which produces 

only a slight signal delay. As a result of this sequence it 
is no longer necessary to first store all the information 
in the third frame portion in a memory in the receiver. 
Upon arrival of the second digital signal the allocation 

2 5 information is stored in a memory in the receiver. 

Information content of the allocation information is much 
smaller than the information content of the samples in the 
third frame portion, so that a substantially smaller store 
capacity is needed than in the case that all the samples 

3 0 would have to be stored in the receiver. Immediately upon 

arrival of the serial data stream of the samples in the 
third frame portion this data stream can be divided into 
the various samples having the number of bits specified by 
the allocation information, so that no previous storage of 
3 5 the signal information is necessary. The allocation 

information for all the subbands can be included in a 
frame. However, this is not necessary, as will become 
apparent hereinafter. 
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The transmission system may be characterized 
further in that in addition the third frame portion 
includes information related to scale factors, a scale 
factor being associated with at least one of the quantised 
5 subband signals contained in the third frame portion, and 
in that the scale factor information is included in the 
third frame portion before the quantised subband signals. 
The samples can be coded in the transmitter without being 
normalised, i.e. without the amplitudes of a block of 
10 samples in a subband having been divided by the amplitude 
of the sample having the largest amplitude in this block, 
in that case no scale factors have to be transmitted. If 
£ the samples are normalised during coding, scale factor 

information has to be transmitted to provide a measure of 
15 said largest amplitude. If in this case the scale factor 
information is also inserted in the third frame portion 
> before the samples it is possible that during reception 4*e- 

the scale factors to be derived from said scale 
information are first stored in a memory and the samples 
20 are multiplied immediately upon arrival, i^. without a 
time delay, by the inverse values of said scale factors. 
£ ^The scale factor information may be constituted by the 

scale factors themselves. It is obvious that a scale factor 
as inserted in the third frame portion may also be the 
25 inverse of the amplitude of the largest sample in a block, 
so that in the receiver it is not necessary to determine 
the inverse value and consequently decoding can be faster. 
Alternatively, the values of the scale factors may be 
encoded prior to insertion in the third frame portion as 
30 scale factor information and subsequent transmission. 

Moreover, it is evident that if after quantisation in the 
transmitter the subband signal in a subband is zero, which 
obviously will be apparent from the allocation information 
for the subband, no scale factor information for this 
£ 35 subband has to be transmitted. The transmission system, in 
which the receiver comprises a decoder comprising 
synthesis-filter means responsive to the respective 
quantised subband signals to construct a replica of the 
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; wide-band digital signal, wh±cfc A synthesis-filter means 

combine the subbands applying sample-frequency increase to 
form the signal band of the wide-band digital signal, may 
be characterized in that the samples of the subband signals 
5 (if present) are inserted in the third frame portion in a 
sequence corresponding to the sequence in which said 
samples are applied to the synthesis-filter means upon 
reception in the receiver. Inserting the samples in the 
third frame portion in the same sequence as that in which 
10 they are applied to the synthesis-filter means in the 

receiver also results in fast decoding, which again does 
not require additional storage of the samples in the 
receiver before they can be further processed. 
Consequently, the storage capacity required in the receiver 
15 can be limited substantially to the storage capacity needed 
for the storage of the system information, the allocation 
information and, if applicable, the scale factor 
information. Moreover, a limited signal delay is produced, 
which is mainly the result of the signal processing 
20 performed upon the samples^The allocation information for 
the various quantised subband signals is suitably inserted 
in the second frame portion in the same sequence as that in 
which the samples of the subband signals are included in 
the third frame portion. The same applies to the sequence 
25 of the scale factors. If desired, the frames may also be 
divided into four portions, the first, the second and the 
third frame portion being as described hereinbefore. 
The last (fourth) frame portion in the frame may then 
contain error-detection and/or error-correction 
3 0 information. Upon reception of this information in the 

receiver it is possible to apply a correction for errors 
produced in the second digital signal during transmission. 
As- already stated, the wide-band digital signal may be a 
monophonic signal. Alternatively, the wide-band digital 
35 signal may be a stereo audio signal made up of a first 
(left) and a second (right) channel component. If the 
transmission system is based on a subband-coding system the 
transmitter will supply subband signals each comprising a 

O 
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first and a second subband-signal component, which after 
quantisation in the quantisation means are converted to 
form first and second quantised subband signal components. 
In this case the frames should also include allocation 
5 information and scale-factor information (if the samples 

have been scaled in the transmitter) . The sequence is also 
t important here. S^eh-a-^tecmsi n ibs iun s ystem io t hcrofo^ e 

de^ifte^-A n the appem ded-"€jr€Ojaxs^ia--^e-^^ It is obvious that 
the system can be extended to handle a wide-band digital 
10 signal comprising more than two signal components. 

The inventive steps may be applied to <*£1£P*\ 
£ transmission systems, for example systems for^tsSe^*^ 

transmission of digital audio signals (digital audio 
£ broadcast) via^tho - othor^ However, other uses are also 
15 conceivable. An example of this is a transmission via 

optical or magnetic media. Optical-media transmissions may 
be, for example, transmissions via glass fibres or by means 
of optical discs or tapes. Magnetic-media transmissions are 
possible, for example, by means of a magnetic disc or a 
20 magnetic tape. The second digital signal is then stored in 
the format as proposed by the invention in one or more 
tracks of a record carrier, such as an optical or magnetic 
disc or a magnetic tape. The versatility and flexibility of 
the transmission system thus resides in the special format 
25 with which the information in the form of the second 

digital signal is transmitted, for example via a record 
carrier. This is combined with the special construction of 
the transmitter which is capable of generating this 
special format for various types of input signals. 
3 0 The transmitter generates the system information required 
for every type of signal and inserts this information in 
the data stream to be transmitted. At the receiving end 
t this is achieved by means of a specific receiver, which 
extracts said system information from the data stream and 
3 5 employs it for a correct decoding. 

The information packets a^e- then constitute a kind 
of fictitious units, which are used to define the length of 
a frame. This means that they need not be explicitly 
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discernible in the information stream of the second digital 
signal. Moreover, the relationship of the information 
packets with the existing digital audio interface standard 
is as defined in the IEC standard no. 958. This standard as 
5 normally applied to consumer products defines frames 

containing one sample of both the left-hand and the right- 
hand channel of a stereo signal. These samples are 
3- represented by means of 16-bit two's complement words. If N 
^ = 32 is selected, one frame of this digital audio interface 
10 standard can transmit exactly one information packet of the 
second digital signal. In the digital audio interface 
standard the frame rate is equal to the sample rate. For 
the present purpose the frame rate should be selected to be 
equal to BR/N. This enables the present ICs employed in 
15 standard digital audio interface equipment to be used. 

Embodiments of the invention will now be described 
in^more 'detail, by way of example, with reference to the 

the transmitter and made up of frames, each frame bexng 
composed of information packets, 

Fig. 2/ gives the structure of a frame, 
Fig. 3* shows the structure of the first frame 

portion of a frame, 

Fig. 4^gives an example of the transmission system, 
Fig. 5«is a table specifying the number of 
information packets B in a frame for specific values of the 
bit rate BR and the sample frequency F s , 

Fig. 6^ gives the number of frames in a padding 
sequence and a number of frames thereof comprising an 

additional information packet (dummy slot) for a number of 

\ 

values of the bit rate BR, 

Fig. 7 represents the system information included 

in the first frame portion of a frame, 

Fig. 8 ^illustrates the distribution of the digital 
information about the various (two) channels for a number 

of modes, | 

Fig. 9 illustrates the significance of the 
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allocation information as, inserted in the second frame 

portion, / * 

Figs. 10 and 11 illustrate the sequence in which 
the allocation information is stored in the second frame 
portion for two formats, format A and format B 
respectively. jjf 

Fig. 12 //shows an example of a receiver, 
Fig. 13* shows a transmitter in the form of a device 
for recording the second digital signal on a magnetic 
record carrier/ 

Fig. 14. shows the receiver in the form of a device 
for reproducing th^ second digital signal from a magnetic 
record carrier L ^^ 

shovyrsome further possibilities of 
including the scale factors and samples in the third frame 
portion of a frame, 

Fig. 16 fehows a further modification of the 

transmitter, ^ 

Fig. 17 shows another structure of the first frame 

portion of a frame, 

Fig. 18 shows the system information included in 
the first frame porti of illustrated in Fig. 17, 

Figs. 19 ' and 20 show in more detail the information 
in the first frame/ portion illustrated in Fig. 17, 

Ficrs. 21 f and 22 illustrate the sequence in which 
the allocation information is accommodated in the second 
frame portion associated with the first frame portion of 

Fig. 17, / 

Fig. 23 gfves the structure of a frame filled with 

an additional signal, 

Fig. 24 illustrates how the scale factors are 

derived, / 

Fig. 25 illustrates the quantisation of the scaled 
samples to form^-bit digital representations, and 

Fig. 2 6* illustrates the dequantisation of the q-bit 
digital represe^ionj^ 

,M ~ Fig. 1 s&o^ddagOT the secbnd digital 

signal as generated by the transmitter and transmitted via 
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the transmission medium. The second digital signal takes 
the form of the serial digital data stream. The second 
digital signal comprises frames, two such frames, i.e. the 
frame j and the frame j+1, being given in Fig. la. 
The frames, such as the frame j, comprise a plurality of 
information packets IP1, IP2, IP3 , see Fig. lb. 

Each information packet, such as IP3, comprises N bits b 0 , 
f b x , b 2 , *>N-1' see Fi 9- lc - The number of information 

packets in a frame depends upon 
10 (a) the bit rate BR with which the second digital signal is 
transmitted via the transmission medium, 
(b) the number of bits N in an information packet, N being 
1 larger than 1, 

" (c) F s , being the sample frequency of the wide-band digital 

' 15 signal, and 

^ U (d) the number of samples n s of the wide-band digital 

signal, the information which corresponds thereto and 
r which after conversion in the transmitter ^^gU^ 

the second digital signal being included in one^&W^ 
2 0 in the following manner. 

The parameter P is computed in conformity with the 
following formula 
BR n s 



P = — x — 
25 N F s 

If this computation yields an integer for P the number of 
information packets B in a frame will be equal to P. If the 
computation does not result in an integer some frames will 
\° comprise P' information packets and the other frames will 
3 0 ^comprise P'+l information packets. P' is the next lower 

integer following P. The number of frames comprising P' and 
^5 P'+l information packets is obviously selected in such away 
7 that the average frame rate is equal to F#/n$- Hereinafter 

i\ it is assumed that N=32 and n s =384. The table in Fig. 5 
35 gives the number of information packets (slots) in one 

^\ frame for these values for N and n s and for four values of 
the bit rate BR and three values for the sample frequency 
£ \\ Fg. It is evident that for a sample frequency F $ equal to 
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44.1 kHz the parameter P is not an integer in all cases and 
that consequently a number of frames comprise 34 
information packets and the other frames comprise 35 
information packets (when BR is 128 kbit/s) . This is also 
illustrated in Fig. 2. Fig. 2 shows one frame. The frame 
comprises P' information packets IP1, IP2, IP P'. 

Sometimes a frame comprises P'+l information packets. 
This is achieved by assigning an additional information 
packet (dummy slot) to the frames of P' information 
packets. The second column of the table of Fig. 6 gives the 
number of frames in the padding sequence for a sample 
frequency of 44.1 kHz and the aforementioned four bit 
rates. The third column specifies those frames of said 
number of frames in the sequence which comprise P'+l 
information packets. By subtracting the numbers in the 
second and the third column from each other this yields the 
number of frames in the sequence comprising P' information 
packets. The (P'+l)th information packet then need not 
contain any information. The (P'+l)th information packet 
may then comprise for example only zeroes. It is obvious 
that the bit rate BR is not necessarily limited to the four 
values as given in the tables of Figs. 5 and 6. Other (for 
example intermediate) values are also possible. Fig. 2 
shows that a frame comprises three frame portions FD1, FD2 
and FD3 in this order. The first frame portion FD1 contains 
synchronising information and system information. 
The second frame portion FD2 contains allocation 
information. The third frame portion FD3 contains samples 
and, when applicable, scale factors of the second digital 
signal. For a further explanation it is necessary to first 
describe the operation of the transmitter in the 
transmission system in accordance with the invention. 

Fig. 4 shows diagrammatically the transmission 
system comprising a transmitter 1 having an input terminal 
2 for receiving the wide-band digital signal S BB , which may 
be for example a digital audio signal. In the case of an 
audio signal, this may be a mono signal or a stereo signal, 
in which case the digital signal comprises a first (left 
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channel) and a second (right channel) signal component. 
It is assumed that the transmitter comprises a coder for 
subband coding of the wide-band digital signal and that the 
receiver consequently comprises a subband decoder for 
5 recovering the wide-band digital signal. The transmitter 

comprises analysis filter means 3 responsive to the 
4 digital wide-band signal S BB to generate a plurality of M 
"ft subband signals S SB i to S SBM , which analysis filter means 
"(I divide the signal band of the wide-band signal S BB with 
10 sample-frequency reduction into successive subbands having 
9-^ band numbers M (1 < m < M) , which increase with the 

frequency. All these subbands may have the same bandwidth 
but, alternatively, the subbands may have different 
bandwidths. In that case the subbands may correspond, for 



15 example, to the bandwidths of the critical ba ^4^<?^ 
£ human ear. The transmitter further comprises^e an ^ f o ^t 

r block-by-block quantitation of the respective subband 



signals. These quant i^al*©** means are shown in the block 
bearing the reference numeral 9 in Fig. 4. 
20 Such a subband coder is known p_er se and is 

described inter alia in the aforementioned publications by 

Krasner and by Theile et ^ ^^^^^MJoxta^^ 
£ published European Patent l&VS&XoW^oio^mr^^^ - 

For a further description of the operation of the 
25 subband coder reference is made to said publications. These 
publications are therefore incorporated herewith by 
reference. Such a subband coder enables a significant data 
reduction to be achieved, for example a reduction from 16 
bits per sample for the wide-band digital signal S BB to for 
3 0 example 4 bits per sample in the signal which is 

transmitted to the receiver 5 via the transmission 
YU medium 4, see Fig. 4. Above n s is assumed to be 384^ 
5 This means that there are blocks of 384 samples of the 

t wide-band digital signal, each sample having a length of 16^ 
bits. Now it is also assumed that M = 32. Consequently, the 
/S wide-band digital signal is split into 32 subband signals 
I in the analysis filter means 3. Now .32_ (blocks of) subband 
5 signals appear on the 32 outputs of the analysis filter 
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means, each block comprising 12 samples (the subbands have 
5 equal width) and each sample having a length of 16 bits. 
This means that on the outputs of the filter means 3 the 
information content is still equal to the information 
content of the block of 384 samples of the signal S BB on 
the input 2. The means 9 now provide data reduction in 
that, using the knowledge about masking, the samples in the 
32 blocks of 12 samples, each block for one subband, are 
quantised more roughly and can thus be represented by a 
smaller number of bits. In the case of a static bit 
allocation all the samples per subband per frame are 
expressed in a fixed number of bits. This number can be 
different for two or more subbands but it can also be equal 
ft for the subbands, for example equal to 4 bits. In the case 
of dynamic bit allocation the number of bits selected for 
every subband may differ viewed in time, so that sometimes 
an even larger data reduction or a higher quality with the 
same bit rate can be achieved. 

The subband signals quantised in the block 9 are 
applied to a generator unit 6. Starting from the quantised 
subband signals this unit 6 generates the second digital 
signal as illustrated in Figs. 1 and 2 . This second digital 
signal, as stated hereinbefore, can be transmitted directly 
via the medium. However-, preferably this second digital 
signal is first adapted to be transmitted via the 
transmission medium 4 in a signal converter (not shown) . 
<£> Such a signal converter comprises, for example, an 8-to-10 
L converter. Such an 8-to-10 converter is described in, for 
3 example , the .APPlican^gu^ 
30 f? 150,082 '4^^£^C^2 ^^e^^S^erts 8-bit data 
\\ t> words into 10-bit data words. Moreover, such a signal 
' converter enables an interleaving process to be applied. 

The purpose of all this is to enable an error correction to 
be performed on the information to be received at the 
35 receiving side. 
O It is obvious that the signal received from the 

r transmission medium 4 by the receiver 5 should then be de- 

interleaved and subjected to a 10-to-8 conversion. 
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^ /t *^^^ e 1 ^ oinposit i on and content of the frames will now 
P be explained in more detail. The first frame portion FD1 in 
' Fig. 2 is shown in greater detail in Fig. 3. Fig. 3 clearly 

£ shows that the first frame portion now comprises exactly 32 
5 bits and is therefore exactly equal to one information 
packet, namely the first information packet IP1 of the 
$ frame. The first 16 bits of the information packet form the 
synchronising signal (or synchronising word) . The 
synchronising signal may comprise for example only "ones". 
1<$ The bits 16 to 31 represent the system information. The 

i_ bits 16 to 23 represent the number of information packets 
JlO in a frame. This number consequently corresponds to P' , 
f both for the frame comprising P' information packets and 
for frames comprising the additional information packet IP 
£ 15^° P'+l. P' can be 254 (1111 1110 in bit notation) at the most 
in order to avoid resemblance to the synchronising signal, 
t The bits 24 to 31 provide frame format information. Fig. 7 

gives an example of the arrangement and significance of 
this information. Bit 24 indicates the type of frame. In 
2 0 the case of format A the second frame portion has another 
length (a different number of information packets) than in 
the case of format B. As will become apparent hereinafter, 
the second frame portion FD2 in the A format comprises 8 
information packets, namely the information packets IP2 to 
2 5 IP9 inclusive and in the B format it comprises 4 

information packets, namely the information packets IP2 to 
IP5 inclusive. The bits 25 and 26 indicate whether copying 
of the information is allowed. The bits 27 to 31 indicate 
the function mode. This means: 
^30 a) the channel mode, which indicates the type of wide-band 
signal (as stated hereinbefore this may be a stereo 
audio signal, a mono audio signal, or an audio signal 
comprising two different signal components for example 
representing the same text but in two different 
35 languages). Fig. 8 represents the channel mode. It 

illustrates how the signal components are divided 
between the two channels (channel I and channel II) in 
the aforementioned cases. 
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^ J b) the sample frequency F s of the wide-band signal. 
P c) the emphasis which may be applied to the wide-band 

T & digital signal in the transmitter. The values 50 and 15 

M s are the time constants of the emphasis and CCITT J. 
5$ The value 17 indicates a specific emphasis standard as 

OJ^^ defined by the CCITT (Comite Consultative Internationale 

<x ^ de Telegraphie et Telephonie) . 

The content of the frame portion FD2 in Fig. 2 will be 
described in more detail with reference to Figs. 9, 10 and 
11. In the A format the second frame portion contains eight 
information packets. This is because it is assumed that the 
& wide-band digital signal S BB is converted into 32 subband 
signals (for every signal portion of the digital signal 
\l s BB ) . An allocation word having a length of four bits is 
15 assigned to every subband. This yields a total of 64 

allocation words having a length of 4 bits each, which can 
be accommodated exactly in eight information packets. 
In the B format the second frame portion accommodates the 
allocation information for only half the number of 
subbands, so that now the second frame portion comprises 
only 4 information packets^Fig. 9 illustrates the 
significance of the four-bit allocation words AW. 
An allocation word associated with a specific subband 
specifies the number of bits by which the samples of the 
25 subband signal in the relevant subband are represented 
after quantisation in the unit 9. For example: the 
£ allocation word AW which is 0100 indicates that the samples 

are represented by 5-bit words. Moreover, it follows from 
|? Fig. 9 that the allocation word 0000 indicates that no 
3 0 samples have been generated in the relevant subband. 

This may happen, for example, if the subband signal in an 
adjacent subband has such a large amplitude that this 
signal fully masks the subband signal in the relevant 
subband. Moreover, the allocation word 1111 is not used 
35 because it bears much resemblance to the sync word in the 
first information packet IPirVig.10 indicates the 
sequence, in the case that the frame mode is A, in which 
the allocation words AW, j,m associated with the two 

A 
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channels j , were j=I or II, and the 32 subbands of the 
sequence number m, m ranging from 1 to 32, are arranged in 
' the second frame portion. The allocation word AWI,1 
belonging to the first subband signal component of the 
first and lowest subband (channel I, subband 1) is inserted 
first. After this the allocation word AWII,1 belonging to 
the second subband-signal component of the first and 
lowest subband (channel II, subband 1) is inserted in the 
second frame portion FD2. Subsequently, the allocation word 
AWI,2 belonging to the first subband-signal component of 
the second and lowest but one subband (channel I, subband 
2) is inserted in the frame portion FD2. This is followed 
by the allocation word AW 11,2 belonging to the second 
subband-signal component of the second subband (channel II, 
subband 2) . This continues until the allocation word AW 
11,4 belonging to the second subband-signal component of 
the fourth subband (channel II, subband 4) is inserted in 
the second frame portion FD2. The second information packet 
IP2 (slot 2) of the frame, which is the first information 
packet in the frame portion FD2 of the frame, is then 
filled exactly. Subsequently, the information packet IP3 
(slot 3) is filled with AW 1,5; AW 11,5; ... AW 11,8. 
This continues in the sequence as illustrated in Fig. 10. 
#Fig. 10 merely gives the indices j-m of the inserted 

allocation word AW, j ,m. Fig. 11 indicates the sequence for 
the allocation words in the case of a B-format frame, 
in this case only allocation words of the subbands 1 to 16 
are inserted. The sequence, as is illustrated in Fig. 10, 
corresponds to the sequence in which the separate samples 
belonging to a channel j and a subband m are applied to the 
synthesis filter means upon reception in the receiver. 
This will be explained in greater detail hereinafter. 
The serial data stream contains for example only frames in 
conformity with the A format. In the receiver the 
allocation information in each frame is then employed for 
correctly deriving the samples from the information in the 
third frame portion of said frame. However, the serial data 
stream may also comprise, more or less alternately, both 
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frames in conformity with the A format and frames in 
conformity with the B format. However, the frames in 
conformity with both formats may contain samples for all 
channels and all subbands in the third frame portion. A 
5 frame in conformity with the B format then lacks in fact 
the allocation information required to derive the samples 
for the channels I or II of the subbands^l? to 32 from the 
£ third frame portion of a B format frame. The receiver 

comprises a memory in which the allocation information 
10 included in the second frame portion of an A format frame 
can be stored. If the next frame is a B format frame only 
the allocation information for the subbands 1 to 16 and the 
channels I and II in the memory is replaced by the 
allocation information included in the second frame portion 
t 15 of the B format f rame^^d--£or--4e r ivi ng - tho samples for the 
subbands 17 to 32. from the third frame portion of the B 
£ format f rane^Sfc42^S4»ff the allocation information for 

these subbands derived from the preceding A format frame 
and still present in the memory. The reason for the 
alternate use of A format frames and B format frames is 
that for some subbands the allocation information fin the 
present case the allocation information for the higher 
£ subbands 17 to 32), does not change rapidly. Since during 

quantisation knows the allocation information for the 
25 various subbands is available in the transmitter, this 

transmitter can decide to generate a B format frame instead 
of an A format frame if the allocation information for the 
subbands 17 to 32 inclusive does not change 
(significantly). Moreover, this illustrates that now 
30 additional space becomes available for^the inclusion of 
£ samples in the third frame portion FD3 . For a specific 

j^P value of P« the third frame portion of a B format frame is 
four information packets longer than the third frame 
portion of an A format frame. Consequently, this enables 
35 the number of bits by which the samples in the lower 

subbands 1 to 16 are represented to be increased, so that 
for these subbands a higher transmission accuracy can be 
achieved. Moreover, if it is required to quantise the 
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lower subbands more accurately the transmitter can 
automatically opt for the generation of B format frames. 
This may then be at the expense of the accuracy with which 
the higher subbands are quantised. 

The third frame portion FD3 in Fig. 2 contains the 
samples of the quantised subband-signal components for the 
two channels. If the allocation word 0000 is not present in 
the frame portion FD2 fo^n^e- of the subband channels this 
means that in the present example twelve samples are 
inserted in the third frame portion FD3 for each of the 32 
subbands and 2 channels. This means that there are 768 
samples^-fetl^n the transmitter the samples may be 
multiplied by a scale factor prior to their quantisation. 
For each of the subbands and channels the amplitudes of the 
twelve samples are divided by the amplitude of that sample 
of the twelve samples which has the largest amplitude, 
in that case a scale factor should be transmitted for every 
subband and every channel in order to enable the inverse 
operation to be performed upon the samples at the receiving 
end. For this purpose the third frame portion then contains 
scale factors SF j,m, one for each of the quantised 
subband-signal components in the various subbands. 
in the present example, scale factors are represented by 6- 
bit numbers, the most significant bit first, the values 
ranging from 000000 to 111110. The scale factors of the 
subbands to which these are allocated, i.e. whose 
allocation information is non-zero, are transmitted before 
the transmission of the samples begins. This means that the 
scale factors are accommodated in the leading part of the 
frame portion FD3 before the samples. This enables a rapid 
decoding in the receiver 5 to be achieved without the 
necessity of storing all the samples in the receiver, as 
will become apparent hereinafter. A scale factor SF j,m can 
thus represent the value by which the samples of the signal 
in the j-th channel of the m-th subband have been 
multiplied. Conversely^the number one divided by said 
value may be stored as the scale factor so that at the 
receiving end it is not necessary to divide the scale 



p 



PHN 13-241 21 1.5.1990 

: factors before the sample^ are scaled up to correct values. 

For the frame format A the maximum number of scale 
factors is 64. If the allocation word AW j,m for a specific 
£ channel j and a specific subband m has the value 0000, 
5 which means that for this channel and this subband no 

samples are present in the frame portion FD3 , it will not 
be necessary to include a scale factor for this channel and 
this subband. The number of scale factors is then smaller 
than 64. The sequence in which the scale factors SF j,m are 
10 inserted in the third frame portion FD3 is the same as that 
in which the allocation words have been inserted in the 
second frame portion. The sequence is therefore as follows: 
& SF 1,1; SF 11,1; SF 1,2; SF 11,2; SF 1,3; SF 11,3;...^ 

SF 1,32, SF 11,32. 
5 ^If it is not necessary to insert a scale factor the 

sequence will not be complete, the sequence may then be 
for example: 

....SF 1,4; SF 1,5; SF 11,5; SF 11,6; 

In this case the scale factors for the fourth subband of 
channel II and the sixth subband of channel I are not 
inserted. If the frame is a B format frame it may still be 
considered to insert scale factors in the third frame 
portion for all the subbands and all the channels. However, 
this is not necessarily so. In this case it would be 
25 possible to insert scale factors in the third frame portion 
of the frame for the subbands 1 to 16 only. In the 
receiver this requires a memory in which all scale factors 
can be stored at the instant at which a previously arriving 
A format frame is received. Subsequently upon reception of 
3 0 the B format frame only the scale factors for the subbands 
1 to 16 are replaced by the scale factors included in the B 
format frame. The scale factors of the previously received 
A format frame for the subbands 17 to 32 are then used in 
order to restore the samples for these subbands included in 
3 5 the third frame portion of the B format frame to the 
correct scale. 

The samples are inserted in the third frame 
portion FD3 in the same sequence as the allocation words 

H 
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and the scale factors, one sample for every subband of 
every channel in succession. This means: first all the 
first samples for the quantised subband signals for all the 
subbands of both channels, then all the second samples, 
^5 .... etc. The binary representation of the samples is 
arbitrary, the binary word comprising only "ones" 
preferably not being used again. 

The second digital signal generated by the 
transmitter 1 is subsequently applied to a transmission 
10 medium 4 via the output 7, and by means of the transmission 
medium 4 this signal is transferred to the receiver 5. 
The transmission via the transmission medium 4 may be a 
wireless transmission, such as for example a radio 
transmission channel. However, other transmission media are 
15 also possible. In this respect an optical transmission may 
be envisaged, for example via optical fibres or optical 
record carriers, such as Compact-Disc-like media, or a 
transmission by means of magnetic record carriers 
utilising RDAT or SDAT-like recording and reproducing 
20 technologies, for which reference is made to the book 

"The art of digital audio" by J.Watkinson, Focal press, 

r . & London 1988.^ 

3W^1?hT^eceiver 5 comprises a decoder, which decodes 

f the signal encoded in the coder 6 of the transmitter 1 and 
25 converts it into a replica of the wide-band digital signal 
supplied to the output 8. 

Fig. 12 shows a more detailed version of the 
receiver 5 in Fig. 4. The coded signal (the second digital 
signal) is applied to a unit 11 via the terminal 10. 
3 0 The essential information in the incoming signal is 
contained in the scale factors and the samples. 
The remainder of the information in the second digital 
signal is merely required for a "correct bookkeeping", to 
allow a correct decoding. The decoding process is repeated 
35 for every incoming frame. The transmitter first derives the 
synchronising and system information from the frames. 
The unit 19 each time detects the sync words situated in 
the first 16 bits of the first frame portion of every 
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frame. Since the sync words of successive frames are each 
time spaced apart by an integral multiple of P' or P'+l 
informationpackets the sync words can be detected very 
accurately ."fjonce the receiver is in synchronism the sync 
5* word can be detected in the unit 19^i n that in the unit 19 
a time window having, for example, a ^^^^^y^ 
information packet is opened after each^twse. P- information 
packets, so that only that part of the incoming information 
is applied to the sync word detector in the unit 19. If the 
10 sync word is not detected the time window remains open for 
the duration of another information packet because the 
preceding frame may be a frame comprising P'+l information 
packets. From these sync words a PLL in the unit 19 can 
0 derive a clock signal to control the central processing 
15 unit 18 .^t is evident from the above that the receiver 
^ should know how many information packets are contained in 
one frame. For this purpose the system information is 
applied to the switching means 15 via an input of the 
processing unit 18, which switching means are then in the 
20 position shown. The system information can now be stored in 
^ a memory 18a of the processing unit 18. The information 
relating to the number of information packets in a frame 
can be applied to the unit 19 via the control-signal line 
20 to open the time window at the correct instants for 

2 5 sync-word detection. When the system information is 

received the switch 15 is changed over to the lower 
position. The allocation information in the second frame 
portion of the frame can now be stored in the memory 18b. 
^ If the allocation information in the incoming frame does 

3 0 not comprise an allocation word for all the subbands and 

channels this will have become apparent already from the 
detected system information. This may be for example the 
information indicating whether the frame is an A-format or 
a B- format frame. Thus, under the influence of the relevant 
3 5 information contained in the system information the 

processing unit 18 will store the received allocation words 
'""J. at the correct location in the allocation memory 18b. 

^It is obvious that in the present example the allocation 
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memory 18b comprises 64 storage positions. If no scale 
factors are transmitted, the elements bearing the reference 
numerals 11, 12 and 17 may be dispensed with and the 
content of the third frame portion of a frame is applied to 
the synthesis filter means via the input 10, which is 
coupled to the input of said filter means via the 
connection 16. The sequence in which the samples are 
applied to the filter means 21 is the same as the sequence 
in which the filter means 21 process the samples in order 
to reconstruct the wide-band signal. The allocation 
information stored in the memory 18b is required in order 
to divide the serial data stream of samples into individual 
samples in the filter means 21, each sample having the 
correct number of bits. For this purpose the allocation 
information is applied to the filter means 21 via the line 
22. The receiver further comprises a deemphasis unit 23 
which subjects the reconstructed digital signal supplied by 
the filter 21 to deemphasis. For a correct deemphasis the 
relevant information in the bits 24 to 31 of the first 
frame portion should be applied from the memory 18a to the 
deemphasis unit 23 via the line 24. 

If the third frame portion also contains the scale 
factors SF j,m the receiver will comprise the switch 11, 
the memory 12 and the multiplier 17. All the instant at 
which the third frame portion FD3 of a frame arrives the 
switch 11 is in the lower position under the influence of a 
control signal applied by the processing unit 18 via the 
line 13. The scale factors can now be applied to the memory 
12. Under the influence of address signals applied to the 
memory 12 by the processing unit 18 via the line 14 the 
scale factors are stored at the correct locations in the 
memory 12. The memory 12 has 64 locations for the storage 
of the 64 scale factors. Again, when a B- format frame is 
received, the processing unit 18 applies such address 
signals to the memory 12 that only the scale factors for 
the subbands 1 to 16 are overwritten by the scale factors 
in the B-format frame. Subsequently, the switch 11 changes 
over to the shown (upper) position under the influence of 
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the control signal applied via the line 13, so that the 
samples are applied to the multiplier 17. Under the 
influence of the allocation information, which is now 
applied to the multiplier 17 via the line 22, the 
multiplier first derives the individual samples of the 
correct bit length form the serial data stream applied via 
the line 16. Subsequently, the samples are multiplied so as 
>o restore them to the correct values of the samples prior 
to scaling-down in the transmitter. If the scale factors 
stored in the memory 12 are the scale factors by which the 
samples have been scaled down in the transmitter these 
scale factors should first be inverted (one divided by the 
scale factor) and should then be applied to the multiplier 
17. Obviously, it is also possible to invert the scale 
factors upon reception before they are stored in the 
memory 12^Xt the scale factors in the frames are already 
equal to the value by which the samples should be scaled up 
during reception they can be stored directly in the memory 
12 and they can be applied directly to the multiplier 17. 
It is evident that no memory is required to store all these 
samples before the signal processing performed upon the 
samples contained in the frame begins. At the instant at. 
which a sample arrives via the line 16 all the information 
required for processing this sample is already available, 
so that processing can be carried out immediately. 
This entire process is effected under the influence of 
control signals and clock signals applied to^all the parts 
of the transmitter by the processing unit 18. By no means 
all the control signals are shown. This is not necessary 
because the operation of the receiver will be obvious to 
those skilled in the art. Under control of the processing 
unit 18 the multiplier 17 multiplies the samples by the 
appropriate multiplication factors. The samples, which have 
now been restored to the correct amplitude, are applied to 
the reconstruction filter 18 in which the subband signals 
are reconverted to form the wide-band digital signal. 
A further description of the receiver is not necessary 
because such receivers are generally known, see for example 
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the publication "Low bit rate coding of high-quality audio 
signals. An introduction to the MAS CAM system" by 
G. Theile et al in EBU Technical Review, no. 230, 
August 1988. Moreover, it will be evident that if the 
system information is also transmitted the receiver can be 
highly flexible and can correctly decode the signals even 
in the case of second digital signals with different system 

diagrammatical ly yet another 
embodiment of the transmitter, which now takes the form of 
a recording device for recording the wide-band digital 
signal on the record carrier, in the present case a 
magnetic record carrier 25. The encoder 6 supplies the 
second digital signal to a recording device 27 comprising a 
15 write head 26 by means of which the signal is recorded in a 
track on the record carrier. It is then possible to record 
the second digital signal in a single track on the record 
carrier, for example by means of a helical-scan recorder, 
in which case the single track is then in fact divided into 
20 juxtaposed tracks which are inclined relative to the 

longitudinal direction of the record carrier. An example of 
this is an RDAT-like recording method. Another method is to 
split the information and simultaneously recording the 
split information in a plurality of juxtaposed tracks which 
25 extend on the record carrier in the longitudinal direction 
of the record carrier. For this the use of an SDAT-like 
recording method may be considered. A comprehensive 
description of the two above methods can be found in the 
aforementioned book "The art of a digital audio" by 
30 J. Watkinson. Again it is to be noted that the signal 

supplied by the unit 6 may be first be encoded in a signal 
£ converter. This encoding may again be an 8-to-10 conversion 
followed by an interleaving process, as described with 
reference to Fig. 4. If the encoded information is recorded 
35 on the record carrier in a plurality of adjacent parallel 
track, this signal converter should also be capable of 
assigning the encoded information to the various tracks. 

Fig. 14 shows diagrammatically an embodiment of 
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the receiver 5, which in the present case takes the form of 
a read device for reading the record carrier 25 on which 
the wide-band digital signal has been recorded in the form 
of the second digital signal by means of the device shown 
5 i n Fig. 13. The second digital signal is read from a track 

on the record carrier by the read head 29 and is applied to 
the receiver 5, which may be for example of a construction 
as shown in Fig. 12. Again the read device 28 may be 
constructed to carry out an RDAT-like or an SDAT like 
10 reproducing method. Both methods are again described 

comprehensively in the aforementioned book by Watkinson. If 
the signal supplied by the unit 6 in the recording device 
^ shown in Fig. 13 has been converted, for example in an 8- 

( .to-10 conversion and in an interleaving step, the encoded 

15 signal read from the record carrier 25 should first be de- ^> 
interleaved and should be subjected to 10-t0-8 conversion. 
Moreover, if the encoded signal has been recorded in a 
plurality of parallel tracks the reproducing unit shown in 
Fig. 14 should arrange the information read from these 
20 tracks in the correct sequence before further processing is 

applied. -i-/r£^ 
f jjtyk/tfli Fig*, ls^show^a number of other possibilities of 

' inserting the scale factors and the samples in the third 

^\ frame portion FD3 of a frame. Fig. 15a illustrates the 

2 5^ above described method in which the scale factors SF for 

all the subbands m and channels (I or II) are inserted in 
^ the third frame portion before the samples. Fig. 15b 

illustrates the same situation as Fig. 15a, but in this 
case it diagrammatically represents the storage capacity 

3 0 for the scale factors SF I,m and SF II, m and the associated 

x samples for these two channels in the subband m. Fig. 15b 
shows the samples for the two channels in the subband m 
combined to blocks, whereas normally they are distributed 
within the third frame portion. The samples have a length 
t 3 5^ of y bits, in the above example x is 12 and y is now taken 
L- "j^ to be 8. Fig.l5c shows another format. The two scale 

factors for the first and the second channel in the subband 
are still present in the third frame portion. However, 
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instead of the x samples for both channels (the left and 
right channels for a stereo signal) in the subband m (i.e. 
2x samples in total) only x samples for the subband m are 
included in the third frame portion. These x samples are 
obtained, for example, by adding corresponding samples in 
each of the two channels to one another. In fact, a 
monophonic signal is obtained in this subband m. The x 
samples in Fig. 15c each have a length of z bits. If z is 
equal to y this saves room in the third frame portion, 
which can be used for samples requiring a more accurate 
quantisation. It is alternatively possible to express the x 
samples of the mono signal in Z - 2y (=16) bits. Such, a 
signal processing is applied if the phase difference 
between the left-hand and the right-hand signal component 
in a subband is irrelevant but the waveform of the 
monophonic signal is important. This applies in particular 
to signals in higher subbands because the phase-sensitivity 
of the ear for the frequency in these subbands is smaller. 
By expressing the x samples of the mono signal in 16 bits 
the waveform is quantised more accurately, while the room 
occupied by these samples in the third frame portion is 
equal to that in the example illustrated in Fig. 15b. 
Yet another possibility is to represent the samples in 
Fig. 15 by for example JJ2_ bits. The signal definition is 
then more accurate than in the example illustrated in 
Fig. 15b whilst in addition room is saved in the third 
frame portion. When at the receiving end the signals 
included in the third frame portion as illustrated in 
Fig.l5c are reproduced a stereo effect is obtained which is 
referred to as "intensity stereo". Here, only the 
intensities of the left-channel and the right-channel 
signals (in the subband m) can differ because of a 
different value for the scale factors SF I,m and SF II, m. 

Fig. 15d gives still another possibility. In this 
case there is only one scale factor SFm for both signal 
components in the subband m. This is a situation which may 
occur in particular for low- frequency subbands. Yet another 
possibility, which is not shown, is that the x samples for 
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the channels I and II of the subband m, as in Fig. 15b, do 
not have associated scale factors SF I,m and SF II, m. 
Consequently, these scale factors are not inserted in the 
same third frame portion. In this case the scale factors 
SF I,m and SF II, m included in the third frame portion of a 
preceding frame must be used for scaling up the samples in 
the receiver. 

All the possibilities described with reference to Fig. 15 
can be employed in the transmitter in order to achieve a 
10 most efficient data transfer via the transmission medium. 
Thus, frames as described with reference to Fig. 15 may 
occur alternately in the data stream. It will be 
appreciated that, if the receiver should yet be capable of 
correctly decoding these different frames, information 
about the structure of these frames should be included in 
the system 4.nf ormation. 

ita^^F^lS shows the transmitter in more detail. 
The Figure shows how the various items of information can 
be combined to form the serial data stream as given in 
Figs.l, 2 and 3. Fig. 16 in fact shows a more detailed 
version of the encoder 6 in the transmitter 1. The encoder 
comprises a central processing unit 30, which controls a 
number of devices in the encoder. The encoder comprises a 
generator 31 included in the processing unit 30 for 
25 generating the synchronising information and the system 
information, as described with reference to Fig. 3, 
a generator 32 for defining the allocation information, 
a generator 33 (optional) for determining the scale 
factors, a generator 34 for defining the samples for a 
3 0 frame. The generator 35 is a generator which is capable of 
generating the additional information packet IP P'+l. 
The outputs of these generators are coupled to associated 
inputs of switching means 40 in the form of a five- C? 
position switch whose output is coupled to the output 7 of 
35 the encoder 6. The switching means 40 are also controlled 
by the processing unit 30. The various generators are 
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controlled via the lines 41.1 to 41.4. The operation of the 
transmitter will be described for a mono signal divided 
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into M subband signals. These M subband signals S SB1 to 
SsBM are applied to the terminals 45.1, 45.2, 45. M. 

For example, blocks of 12 samples of each of the subband 
signals are taken together. In the unit 46.1 to 46. M, if 
present, the twelve samples in a block are scaled to the 
amplitude of the largest sample in the block. The M scale 
factors are applied to the unit 33 (if present) via the 
lines 47.1 to 47. M. The subband signals are applied both to 
the M quantisers 48.1 to 48. M and to a unit 49. For every 
subband the unit 49 defines the number of bits with which 
the relevant subband signal should be quantised. This 
information is applied to the respective quantisers 48.1 to 
48. M via the lines 50.1 to 50. M, so that these quantisers 
correctly quantise the 12 samples of each of the subband 
15 signals. Moreover this (allocation) information is applied 
to the unit 32. The samples of the quantised subband 
signals are applied to the unit 34 via the lines 51.1 to 
51. M. The units 32, 3 3 and 34 arrange the allocation 
information, the scale factors and the samples in the 
correct sequence i.e. in the sequence as described 
hereinbefore. Moreover, the processing unit 30 has 
generated the synchronising information and the system 
information associated with the frame to be generated, in 
which said information stored in the units 32, 33 and 34 
should be inserted .^In the shown position of the switching 
means 40 the synchronising and system information for a 
frame is supplied by the generator 31 and fed to the 
output 7. Subsequently, the switch 40 is set to the second 
position from the top under the influence of the control 
signal supplied by the CPU 30, via the line 53 so that the 
output of the generator 32 is coupled to the output 7. 
Now the allocation information is applied to the output 7 
by the generator 32. The sequence of the allocation 
information is as described with reference to Fig. 10 or 
35 11. After this the switch 40 is set to the third position 
from the top. This means that the output of the generator 
33 is coupled to the output 7. The generator 33 now 
supplies the scale factors in the correct sequence to the 
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output 7. The switch 40 is now set to the next position, so 
that the output of the generator 34 is coupled to the 
output 7. Now the generator 34 supplies the samples in the 
various subbands in the correct sequence to the output 7. 
In this cycle exactly one frame is applied to the output 7. 
Subsequently, the switch 40 is reset to the top position. 
A new cycle is started, in which a subsequent block of 12 
samples for each subband is encoded and a subsequent frame 
can be generated on the output 7. In some cases, for 
example if the sample frequency F s is 44 . 1 kHz, see Fig. 5, 
an additional information packet (the dummy slot, see 
Fig. 2) must be added. In that case the switch will be set 
from the position in which the generator 34 is coupled to 
the bottom position. The output of the generator 35 is now 
coupled to the output 7. Now the generator 35 generates the 
additional information packet IP P'+l, which is applied to 
the output 7. After this the switch 40 is reset to the top 
position to start the next cycle. It is obvious that, if 
the signal received by the transmitter is to be corrected 
for errors caused during transmission of the signal, a 
specific channel coding should be applied to the second 
digital signal. In addition, it is required to modulate the 
second digital signal prior to transmission of the second 
signal. Thus, a digital signal is transmitted via the 
transmission medium, which signal may not be directly 
identifiable as the second signal but which has been 
derived therefrom. Further, it is to be noted that, for 
example in the case that the subbands have different 
widths, the number of samples for the various subbands 
inserted in one third frame portion may differ and are 
likely to differ. It is assumed, for example, that a 
division into three subbands is used, a lower subband SB lt 
a central subband SB 2 and a upper subband SB 3 . The upper 
subband SB 3 will have a bandwidth which is, for example, 
twice as large as that of the other two subbands. 
This means that the number of samples inserted in the third 
4\ frame portion for the subband SB 3 is also twice as large as 
for each of the other subbands. The sequence in which the 
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samples are applied to the reconstruction filter in the 
^ receiver may then be: the first sample of SB lf the first 
sample of SB 3 , the first sample of SB 2 , the second sample 
of SB 3 , the second sample of SB^ the third sample of SB 3 , 

the second sample of SB 2 , the fourth sample of SB 3 , 

etc. The sequence in which the allocation information for 
these subbands is then inserted in the second frame 
"H, portion is now: first the allocation word for SB lf then the 
H allocation word of SB 3 , subsequently the allocation word 
L for SB 2 . The same applies to the scale factors. Moreover, 
the receiver can derive from the system information that in 

tthis case the cycle comprises groups of four samples each, 
each group comprising one sample of SB lf one sample of SB 3 , 
one .sample of SB 2 and subsequently another sample of SB 3 . 
^ ®****" ^Ftg^W^s^other structure of the first 
f frame portion FD1. Again the first frame portion FD1 

contains exactly 32 bits and therefore corresponds to one 
information packet. The first 16 bits again constitute the 
synchronising signal (or synchronisation word) . The 
20 synchronisation word may again be the same as the 

synchronisation word of the first frame portion FD1 in Fig. 
3. The information accommodated in bits 16 through 31 
differs from the information in bits 16 through 31 in Fig. 
H 3. The bits b 16 through b 19 represent the bit rate index 
•25 (BR index). The bit rate index is a_4-bit number whose 

meaning is illustrated in the Table in Fig. 18. If the bit 
I ^ rate index is equal to the 4-bit digital number '0000- this 

' denotes the free-format condition, which means that the bit 

rate is not specified and that the decoder has to depend 
30 upon the synchronisation word alone to detect the 
\ X- ^ beginning of a new frame. The 4-bit digital number •1111- is 
' not employed in order not to disturb the synchronisation 

word detection. In the second column of the Table in Fig. 
18 the bit rate index is represented as a decimal number 
35 A corresponding to the 4-bit digital number. The 

corresponding bit rate values are given in column 1. 

The bits 20 and 21 represent the sample frequency 
F s , see Fig. 18. 
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$ Fig. 18 shows the four possible^g^bit digital 

-H numbers for the bits b 2Q and b 21 and the associated sample 
frequency. Bit 22 indicates whether the frame comprises a 
\ % LJL dummy slot, in which case b 22 = '1', or does not comprise a 
* i- | b U dummy slot, in which case *>„-'<>•. The information in the 
'> ' / (_ b its b through b 22 makes it possible to determine how 

many information packets are actually present in the frame. 
This means again that the first frame portion contains 
information related to the number of information packets in 
10 the frame. As n s is known, which is the number of samples 
of the wide-band signal whose corresponding information 
belonging to the second digital signal is accommodated in 
one frame, in the present example n s = 384, it is possible 
to determine how many information packets B are present in 
the frame by means of the data in the Table in Fig. 8, the 
padding bit b 22 and the formula 

BR n s 
P = — x — • 
N F s 

The bit b 23 is intended for specifying a future extension 
of the system. This future extension will be described 
hereinafter. For the time being this bit is assumed to be 
6 1,3- '0'. The content of the first frame portion, as regards the 
'V\ bits b 24 through b 31 , will be described with reference to 
25 O Figs. 19 and 20. The bits b 24 and b 25 give the mode 
indication for the audio signal. 

For the four possibilities of this two-bit digital 
number Fig. 20 shows whether the wide-band digital signal 
£ I X is a stereo audio signal ('00'), a mono signal ('11'), a 
V?3d|, X bilingual signal ('10'), or an intensity stereo audio 
^ \X signal ('01'). In the last-mentioned case the bits 26 and 
1 27 indicate which subbands have been processed in 

accordance with the intensity stereo method. Figure 20 
K) , indicates for the respective two-bit numbers "00 '01', 

£ 35 U ,10 '' and 'lil«^A the subbands 5-32, 9-32, 13-32 and 17-32 
' ^ ha^Fe been processed in accordance with the intensity stereo 

method. As stated hereinbefore intensity stereo can be 
applied to the higher subbands because the ear is less 
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phase-sensitive for the frequencies in these subbands. The 
W bit b. a can be used as a copyright bit. If this bit is 
BU* ii'«*LjWfcns that the information is copy-protected and 
H should/cannot be copied. The bit t> 29 can indicate that the 

X information is original information (b 29 = for 
1 example in the case of the prerecorded tapes, or 
Ml X information which has been copied (b 29 = '0'). The bits b 3Q 
' and b specify the emphasis which may have been applied to 

the wide-band signal in the transmitter, see also the 
10 description with reference to Fig. 7. 

Hereinafter, another configuration of the second 
frame portion FD2 will be described for the various mode 
\\ indications represented by the bits b 24 through b 2? in the 

first frame portion. Again the second frame portion 
15$ comprises the 4-bit allocation words whose meaning has been 
described with reference to Fig. 9. For the stereo mode 
U (b 24 , b 25 = 00) and the bilingual mode (b 24 , b 25 = 10) the 

second frame portion FD2 again has a length of 8 
information packets (slots) and is composed as described 
\ JL 2 0 with reference to Fig. 10. In the stereo mode -II' in Fig. 

10 then represents, for example, the left-channel component 
\>X and 'II' the right channel component. For the bilingual 
\ y mode 'I' denotes one language and 'II' denotes the other 
' language. For the mono mode <b 24 , t> 25 = ll)Jtte length of 

25 & the second frame portion FD2 is of course only 4 

information packets (slots). Fig. 21 illustrates the 
sequence of the allocation words for the various subbands 1 
through 32 in the four information packets (slots) 2 
through 5. Thus, every quantity M-i represents a four-bit 
30 allocation word which specifies the number of bits in every 
sample in the subband of the sequence number i, i ranging 
from 1 to 32. In the intensity stereo mode (b 24 , b 25 = _0l) 
there are four possibilities indicated by means of the bits 
W b, 6 and b 2? , see Fig. 20. All these possibilities result in 
35 a different content of the second frame portion FD2 . 
^ Figs. 22a to 22d illustrate the four different 

^ contents of the second frame portion. If the switch bits 
\\« 9f A b 26' b 27 are ^fci^tignals ^ the subbands 1 through 4 
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are normal stereo signals and the signals in the subbands 5 
through 32 are intensity-stereo signals. This means that 
for the subbands 1 through 4 for the left-hand and right- O 
hand channel components in these subbands the associated 
5 allocation words should be stored in the second frame 

portion, in Fig. 22a this is represented by the consecutive 
allocation words AW (1, 1); AW <R, 1); AW (1, 2); AW (R, 
2); ... AW (R, 4), stored in the slot 2 of the frame, i.e. 
the first slot of the second frame portion. Fig. 22a only 
10 gives the indices (i-j) of the allocation words, i being 

equal to L or R and indicating the left-hand and the right- ^ 
hand channel component respectively, and j ranging from 1 
through 4 and representing the sequence number of the 
subband. For the subbands 5 through 32 the left-hand and 
15 the right-hand channel components contain the same series 
of samples. The only difference resides in the scale 
factors for the left-hand and the right-hand channel 
components in a subband. Consequently, such a subband 
requires only one allocation word. The allocation words AW 
20 (i, j) for these subbands 5 through 32 are indicated by the 
indices M-j, where i is consequently equal to M for all the 
subbands and where j ranges from 5 through 32. 
-Tgl U Fig. 22a shows that J_l/2 information packets are 

^ ' required for inserting the 36 allocation words in the 
25U second frame portion. If the switch bits b 2g , b 2? are 
\ . 0 l'4*e*signals in the subbands 1 through 8 will be normal 

stereo signals and the signals in the subbands 9 through 32 
will be intensity-stereo signals. This means that for each 
of the subbands 1 through 8 two allocation words AW(L, j) 
30 and AW(R,j) are required and that for each of the subbands 
9 through 32 only one allocation word AW(M,j) is required. 
This implies that in total 40 allocation words are needed, 
included in five information packets (slots), i.e. IP2 
-^y through IP6, of the frame. This is illustrated in Fig. 22b. 
35' in this case the second frame portion FD2 has a length of 
five information packets (slots) . i ^ 
\ If the switch bits b 26 , b 27 are -lO^fefee signals xn 

' the subbands 1 through 12 will be normal stereo signals and 
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the signals in the subbands 13 through 32 will be 
< intensity-stereo signals. Fig. 22c gives the structure of 
the second frame portion FD2 with the allocation words for 
the various subbands. The second frame portion now has a 
length of 5 1/2 information packets (slots) in order to 
accommodate all the allocation words. If the switch bits 
\1 HfO'tX, b 26 , b 2? are • 11 '^tSe^signals in the subbands 1 through 16 
will be normal stereo signals and the signals in the 
subbands 17 through 32 will be intensity-stereo signals. 
10 Now 48 allocation words are needed, which are inserted in 
the second frame portion, which then has a length of 6 
information packets (slots), see Fig. 22d. 

What has been stated above about the scale factors 
is also valid here. When it is assumed that an allocation 
fi> word 0000 has been assigned neither to any of the subbands 
nor to any of the channels, 64 scale factors are required 
both for the stereo mode and for the intensity-stereo 
modes. This is because in all the intensity-stereo modes 
every mono subband should have two scale factors to enable 
intensity-stereo to be realised for the left-hand and the 
^ right-hand channel in this subband, see Fig. 15c. 

It is obvious that in the mono mode the number of 
scale factors is halved, i.e. 32, again assuming that the 
allocation word 0000 has not been assigned to any of the 

25 sl } bb 5 nd li*l6>n i$tiXk***UiaZc*r? 
£ U *C ^ -4^^ J Amethod of determining the 6-bit scale factors 

I w ni now be explained below. As stated hereinbefore, the 

sample having the largest absolute value is determined for 
every 12 samples of a subband channel. 
^ Fig. 24a shows this maximal sample l s max l' Tne 

6 ) 3- first bit, designated SGN, is the sign bit and is '0' 
^ because it relates to the absolute value of S ma}£ . The 
3 samples are represented in two's complement notation. The 
1 ^ # sample comprises k 'zeros' followed by a "1". The values of 
35 the other bits of the 24-bit digital number are not 
relevant and can be either '0'or '1'. 
Cp* VvT |S | is now multiplied by 2 k , see Fig. 24b. 

subsequently |S max |.2 k is compared with a digital number 
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DV X equal to 010100001100000000000000 and a digital number 
DV 2 equal to 011001100000000000000000. If I s m ax ' * 2 < D ^l 2 
specific constant p is taken to be 2 . If DV 1 < |S max |.2 < 
/? DV 2 , then p is taken to be 1. If |S max |.2 k > DV 2 , then p=0. 
I The number k is limited to 0 < k < 20. The scale factor is 
now determined by the numbers k and p in accordance with 
the following formula. 
~f T SF = 3k + p. 

Consequently, the maximum value for SF is 62. This 
10 means that the scale factors can be represented by 6-bit 
# numbers, the six-bit number 111111 (which corresponds to 
the decimal number 63) not being used. In fact, the 6-bit 
binary numbers are not the scale factors but they are in a 
uniquely defined relationship with the actual scale 
factors, as will be set forth below. All the 12 samples S 
are now multiplied by a number which is related to the 
values for k and p. The 12 samples are each multiplied as 
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S' = Sx2 k xg(p) 



The parameter k specifies the number of 6 dB steps and the 
factors (g(l) and g(2) are the closest approximations to 
steps of 2 dB. The samples S' thus scaled are now quantised 
to enable them to be represented by q-bit digital numbers 
3> in two's complement notation. In Fig. 25 t^is is 
$ 30 illustrated for q = The scaled samples have values 
& 51 £ between +„l__and -l^see Fig. 25a. In the quantiser these 

samples must be represented by q bits, q corresponding to 
the allocation value for the relevant subband (channel) . 
Since, as stated above, the q-bit digital number comprising 
£ only 'ones' j^not used to represent a sample the total 

^ \\ interval from -1 to +1 should be divided over 2 q_1 smaller 
^A-* intervals. For this purpose the scaled samples S' ( a*e- 

transformed into the samples S • • j^accordance with the 
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follows. pj 



where the number g(p) has the following relation with p: Y> 
g(p) =1 for p = 0 

g(p) = l +2 - 2 + 2- 8 + 2- 10 + 2- 16 + 2- 18 + 2- 23 for_ ?5 = J 16 _ 1? 

_1 -a -6 -8 -9 -10 -13+2 +2 +2 + x 
g(P> = 1±J 9 1+ ?. 20 +2 6 + 2 8 + 2 9 + 2 +2 _A 
2 +2 for p = 2. 
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formula S" = S • (1-2"*) -2"^ 
£ The samples S ' } ( ai?e A subsequently truncated at q 

I M bits, see Fig. 25c. Since the ^111' representation is not 

' ^ permissible the sign bits are inverted, see Fig. 25d. The 

% 5^T q(=34-=bit numbers given in Fig. 25d are now inserted in the 
third frame portion FD3, see Fig. 2. 
)h Samples S' which comply with -0.71<S'<-0.14 are 

represented by the digital number '001' . This proceeds 
jj^ similarly for samples S' of larger values up to samples 
1^3 103) which comply with 0.71 < S'<1 and which are represented by 
£ the digital number "110 Consequently, the digital number 

g i ^ •111' is not used. 

Dequanti^ation at the receiving side is effected in 
a manner inverse to the quantization at the transmission 
15 side, see Fig. 26. This means that first the sign bits of 

the q-bit digital numbers are inverted to obtain the normal 
V X two's complement notation, see Fig. 26b. 

/ Subsequently the samples S ' are derived from the 

transformed samples S • ' by means of the formula 
V20 S' = <S"+ 2-* +1 ><l + 2-1 + 2" 2 * + 2" 3q + 2- 4 * + ...), see 
^IT Figs. 26c and 26d. 

4-> The values S« thus obtained are now situated exactly within 

the original intervals in Fig. 25a. At the receiving side 
4_.r5 the samples S« are subsequently scaled to the original 
25 amplitudes by means of the transmitted information k, p 
which is related to the scale factors. Thus, at the 
receiving side a number g' (p) complies with: 
g' (p) = 1 for p = 0 

g.(p) = 2-1 + 2" 2 + 2-5 + 2-* for p - 1 
g«(p) = 2 _1 + 2~ 3 + 2"° + 2 for p = 2. 

Scaling to the original amplitudes is now effected using 
the following formula: 

fT ? | f? s = S'.2- k .g» (P) . 

In the two possible versions of a frame as 
3 5 described with reference to Figures 2 and 3 and Figures 2, 
17 and 19 respectively the third frame portion may not be 
filled entirely with information. This will occur more 
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often and sooner as the algorithms for subband coding, i.e. 
the entire process of dividing the signal into subband 
signals and the subsequent quantization of the samples in 
the various subbands, are improved. In particular, this 
5 will enable the information to be transmitted with a 

smaller number of bits (average number per sample) . The 
unused part of the third frame portion can then be utilized 
for transmitting additional information. In the first frame 
portion FD1 in Fig. 17 allowance has been made for this by 
10 means of the "future-use" bit b 23 - Normally, this bit is 
^ 'yV '0', as will be apparent from Fig. 18. 

If an additional signal has been inserted in the 
<H third frame portion FD3 of a frame, the future-use bit b 23 
# l y ;_ in the first frame portion FD1, see Fig. 17, will be '1'. 
15 During reading of the first frame portion FD1 this makes it 
possible for the receiver to detect the frame contains 
additional information. The allocation information and the 
scale factors, see Fig. 23, inform the receiver that only 
the part of the third frame portion FD3, marked FD4 in Fig. 
23, contains quantised samples of the subband signals. The 
remainder, marked FD5 in Fig. 23, now contains the 
additional information. The first bits in this frame 
portion FD5 are designated •EXT INFO* or extension 
information. These bits indicate the type of additional 
2 5 information. The additional information may be, for 

example, an additional audio channel, for example for the 
transmission of a second stereo channel. Another 
possibility is to use these two additional audio channels 
1 ^ to realise 'surround sound' together with the audio subband 
1 30 signals in the frame portion FD4. In that case the front- r^s 
rear information required for surround sound may be 
included in the frame portion FD5. In the part marked FD6 
the frame portion FD5 may again contain allocation 
information, scale factors and samples (in this order) and 
35 the sequence of the allocation words and the scale factors 
may then be similar to the sequence as described with 
reference to Figs. 2 and 3 and Figs. 2, 17 and 19. 

In the case of 'surround sound' simple receivers 
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may merely decode the stereo audio information in the frame 
portions FD2 and FD3 , except for the frame portion FD5. 
More sophisticated receivers are then capable of 
reproducing the surround-sound information and for this 
purpose they also employ the information in the frame 
portion FD5. 

The extension-info bits may also indicate that the 
information in the frame portion FD6 relates to text, for 
example in the form of ASCII characters. It may even be 
considered to insert video or picture information in the 
frame portion FD6 , said information again being 
characterized by the extension-info bits. 

It is to be noted that the invention is not limited 
to the embodiments shown herein. The invention also relates 
to those embodiments which differ from the embodiments 
shown herein with respect to features which are not 
relevant to the invention as defined in the Claims. 



